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1 . [ntroduction 

Finite  source  queueing  models  (machine  repair  models)  (see  [A], 
pp.  118-125)  can  be  useful  for  a variety  of  applications.  One  area  of 
application  is  in  spares  provisioning,  where,  for  a given  population  of 
operating  units  (for  example,  aircraft  engines,  computer  control  modules, 
or  lathes)  which  stochastically  fall  and  are  repairable,  it  is  desired  to 
know  how  many  spare  units  to  have  on  hand  as  well  as  the  capacity  of  the 
repair  facility  in  order  to  guarantee  a certain  service  level.  Gross, 

Kahn,  and  Marsh  f5]  treat  such  an  application  in  detail. 

In  the  Gross  et  al  study  [5],  where  it  was  desired  to  determine 
the  spares  inventory  and  repair  capacity  over  a changing  multi-year  planning 
liorizon,  three  key  assumptions  were  necessary  in  order  to  apply  the  classi- 
cal queueing  theoretic  solutions.  These  assumptions  are  (1)  all  units  in 
tile  population  are  identical,  (ii)  units  and  repair  facilities  operate 
continuously,  and  (ill)  the  system  is  always  in  steady  state.  In  addition, 
units  .ire  assumc-d  to  fall  according  to  a Poisson  process  and  repair  times 
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are  assumed  to  be  exponentially  distributed  random  variables.  Since  these 
latter  two  assumptions  were  considered  to  be  fairly  realistic  in  a variety 
of  situations,  it  is  the  three  assumptions  enumerated  above  which  are 
Investigated  here. 

2.  Populations  with  Unequal  Failure  Rates 

When  applying  the  classic  machine  repair  theory,  one  must  assume 
all  units  are  identical  in  order  to  obtain  the  steady-state  system  size 
probabilities  required  for  determining  the  system  service  level  for  a 
given  number  of  spares  and  repair  channels.  The  question  naturally  arises, 
is  this  an  adequate  assumption?  Steady-state  probabilities  obtained  by 
making  this  simplifying  assumption  approximate  the  true  probabilities. 

It  is  conceptually  possible  to  model  a finite  source  queue  with  spares 
having  different  failure  rates  for  the  calling  units,  but  for  a system  of 
any  reasonable  size  the  equations  become  unwieldy. 

In  this  section  we  study  a finite  source  machine  repair  model 
having  one  machine  and  one  spare.  Here  we  allow  the  calling  units  to  have 
different  failure  rates.  In  particular,  the  effect  of  approximating  the 
steady-state  probabilities  by  assuming  that  all  units  have  failure  rates 
equal  to  the  average  failure  rate  of  the  calling  units  is  investigated. 

In  general,  letting  M denote  the  population  size,  y the  number 
of  spares,  and  c the  number  of  repair  channels  at  the  repair  facility, 
the  mode]  we  work  out  in  detail  here  is  for  M = y = c = 1 . Unit  1 has 
a Poisson  failure  rate  and  Unit  2 . It  is  assumed  that  service 

times  are  independent  exponentially  distributed  random  variables  with 
mean  1/p  , and  that  if  Unit  i (i=l,2)  is  operating  at  time  t , the 

is  X^At  + o(At)  . The 
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probability  it  will  have  failed  by  time  t + At 
following  states  may  be  defined  for  the  system. 
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State 

Unit  1 

Unit  2 

0 

Operating 

Spare 

1 

Operating 

Repair 

2 

Spare 

Operating 

3 

Repair 

Operating 

4 

Queue 

Repair 

5 

Repair 

Queue 

The  steady-state  probabilities  must  satisfy  the  following  balance  equations: 


\P0  = ^‘Pl 


(X^+vi)p^  = + yp^ 


^2P2  " PP3 


(X2+y)P3  = yp^  + X3P0 


(1) 


PP4  = \Pl 


yP5  = X2P3 


5 

I P-i 


1 , 


i=0 


where  p^  is  the  steady-state  probability  that  the  system  is  in  state  i , 


i=0,l,...,5  . Solution  of  the  system  of  Equations  (1)  yields 


Pi  y Pq  ’ 


P4  = Pq  ’ 


Po  = 


A^(A^+y) 


\^2 


2 X2(X2+y)  *^0  ’ 


^5"  .2  Ix.+yjPo 


X^(Xi+y) 


Pj  y(X2+y)  Pq  ’ 


(2) 


Pq  = "J 


y X2(X2+y) 


y'’(Xi+X2)  + y^CX^+X^)^  + 2X^X2(yX2+uX^+X^X2) 
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1 

I 

I 

I 

The  measure  of  system  performance  that  we  use  here  is  the  steady-  j 

state  probability  that  a request  for  a spare  unit  is  met  without  delay,  j 

and  is  equivalent  to  the  steady-state  probability  that  a failed  unit  en-  j 

counters  less  than  y units  in  repair  or  waiting  for  repair.  The  prob-  ; 

ability  that  a failed  unit  finds  the  system  in  a particular  state  is  dif- 
ferent from  the  arbitrary  time  probabilities  given  by  Equation  Set  (2). 

In  common  queueing  terminology  these  are  termed  "arriving  customer"  prob- 
abilities, but  in  this  paper  they  will  be  designated  "failure  point" 
probabilities  since  they  correspond  to  the  occurrence  of  component  failures. 

For  a finite  source-no  spares  queue  (see,  e.g..  Cooper  [3],  pp.  82  ff),  the 
"failure  point"  probabilities  are  equivalent  to  the  general  time  prob- 
abilities for  a finite  calling  population  of  one  less,  but  this  relation- 
ship does  not  hold  for  the  spares  case. 

The  appropriate  failure  point  probabilities  denoted  by  q^  , as 

contrasted  to  the  general  time  probabilities  denoted  by  p , may  be  de- 
ived  as  follows. 

Using  Bayes'  Theorem, 

^n  ~ Pr{system  is  in  state  n| failure  about  to  occur} 

Pr{system  is  in  state  n}Pr{failure  about  to  occur [system  in  state  n} 

^ Pr{system  is  in  state  n}Pr{failure  about  to  occur [system  in  state  n) 
n 

be  modeled  as  a birth-death  process 

= A At  + o(At)  , 
n 

n=0,l 
n=2, 3 
n=4,5 

Thus,  for  n=0  , 

PQ[Xj^At+o(At)] 

'’O  " (Po+P^)IA^At+o(At))  + (p2+P3)[A2At+o(At)]  ’ 
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The  queueing  system  considered  here  may 
with 


where 


Pr{failure  in  t,t+At} 


Af  , 

A 2 f 


0 , 


J 


L.~  
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Dividing  the  numerator  and  denominator  by  At  and  taking  the  limit  yields 


In  = 


\Po 


0 Aj^(Pq+p^)  + A2(p2+P3)  ’ 

where  the  p^  are  given  by  Equation  Set  (2).  Similarly, 

AiPi 


1 Ai(Po+Pi)  + A2(P2+P3) 


^ ^2^2 


= 


A2P3 


3 ^2^P2'^3^ 

Tlie  availability,  or  the  probability  that  a failed  unit  encounters  less 
than  y=l  units  in  the  system,  is  therefore 

A = qg  + q2 


lj(A^+A2+2y) 


(3) 


2(p  +A^y+A2y+A^A2) 


For  purposes  of  comparison,  consider  the  M=1  , y=l  , c=l  queue 
with  exponential  service,  mean  1/y  . It  is  assumed  that  the  two  units 
in  the  system  have  failure  rates  A^  and  both  units  oper- 

ate with  the  average  failure  rate  (A^+A2)/2  . Thus  the  probability  that 

a unit  operating  at  time  t fails  by  time  t + At  is  assumed  to  be 
[ (Aj^+A2) /21At  + o(At)  . The  approximate  general  time  probabilities  are 

then  (see  [4],  p.  123), 


Pi  = 


Ai  + A^ 


2y 


^0  ’ 


P2  \ 2y  / Pq  ’ 
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Pr^  = 


+ y(X^+X2)/2  + /l, 


where 


p.  = approximate  steady-state  probability 
^ of  i units  in  the  system,  i“0,l,2  ; 


and  the  approximate  "failure  point"  probabilities  are 


Po(V^2^/^ 


% Pq(X^+X2)/2  + p^(X^+X2)/2  ’ 


P^(X^+X2)/2 


'll  ° Pq(X^+X2)/2  + p^(X^+X2)/2  * 


where  the  p^  are  defined  above  and 


q . = approximate  probability  that  a failed 
unit  finds  i units  in  the  system. 


i=0,l 


It  follows  that  the  availability  for  this  system,  or  the  failure  point 
probability  of  zero  units  in  the  system,  assuming  an  average  failure  rate 
for  both  units,  is 


A = 


or 


A = 


2il 


2y  + X^  + X2 


(4) 


The  approximate  availability  given  by  A always  underestimates 
the  exact  availability  A . That  is, 

A > A , 


with  equality  holding  only  when  “ ^2  ’ shown  as  follows; 


rewriting  Equations  (3)  and  (4)  in  terms  of  * X^/y  and  P2  = X2/y  yields 


2 + Pi  * Pj 


2(l-fPi+Pi+PiP2) 


2 + Pj^  + p2 
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It  is  easy  to  see  that  A ^ A since 

(Pl-P2)^ 

^ ^ " 2(l+Pj^+P2+PiP2^^^'^Pl'^2^  ~ ° 

The  expression  (5)  is  zero  only  when  p^  = or,  equivalently,  when 
Aj^  = ^2  . Therefore,  the  ap' roxlmate  A is  a lower  bound  on  the  exact 
availability. 

Some  calculations  for  A and  A are  shown  in  Table  I for  differ- 
ent values  of  and  • In  most  cases  the  percent  difference  between 

A and  K is  small  even  though  there  is  a large  difference  between  p^ 
and  p^  . The  greatest  difference  is  27.22%  for  p^  = .1  and  p^  = 2.5  . 
For  cases  where  both  p^  and  p^  are  less  than  one,  the  largest  differ- 
ence is  7.11%  for  p^  = .1  and  P2  ~ • These  results  suggest  that  A 

is  a reasonable  approximation  for  the  true  availability,  particularly  as 
the  differences  in  the  failure  rates  will  probably  be  small  for  most  prac- 
tical cases. 

It  seems  reasonable  to  assume  that  the  approximate  availability, 
based  on  using  the  arithmetic  mean  of  the  failure  rates,  will  always  be 
a lower  bound  on  the  actual  availability  for  systems  involving  larger  num- 
bers of  calling  units  and  more  than  two  failure  rates.  This  is  intuitively 
appealing  since  it  would  be  expected  that,  with  unequal  failure  rates,  the 
less  reliable  units  are  "down"  (i.e.,  in  the  service  facility)  more  fre- 
quently than  the  more  reliable  units.  Thus  the  units  that  are  "up"  at  a 
given  time  tend  to  be  better  than  average,  so  that  the  effective  aggregate 
failure  rate  for  the  units  in  the  system  is  less  than  the  arithmetic 
average  of  the  failure  rates  of  the  units. 

'i.  Noncontinuous  Operation 

Classical  finite  source  queueing  theory  assumes  the  units  in  the 
operating  population  operate  continuously  (along  with  the  repair  facilities/ 
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COMPARISON  OF  ACTUAL  AND  APPROXIMATE  AVAILABILITY  CALCULATIONS 


T-360 
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repalrmea).  In  practice,  this  often  Is  not  the  case.  For  exairple.  If 
we  are  concerned  with  provisioning  spares  for  engines  belonging  to  a fleet 
of  aircraft,  not  all  aircraft  are  always  operating.  Thus  any  engine  (pop- 
ulation component)  may  operate  only  a fraction  of  the  time,  say  a . 

There  are  two  possible  ways  to  account  for  this.  One  way  is  to 
decrease  the  failure  rate  appropriately;  that  is,  if  X is  the  actual 
failure  rate  for  continuously  operating  units  and  each  unit  operates,  on 
the  average,  a fraction  a , then  we  use  an  adjusted  failure  rate  X'  = aX  . 
For  example,  if  a unit  has  a mean  time  between  failures  (MTBF  = 1/X)  of 
1000  hours  and  operates  only  half  the  time  on  the  average,  we  would  assume 
its  MTBF  to  be  2000  hours. 

A second  way  to  account  for  noncont inuous  operations  is  to  adju,. 
the  population  size;  that  is,  use  an  effective  population  size  of  M'  = aM  . 
In  many  cases  this  tmiy  actually  be  closer  to  reality,  as  exemplified 
by  the  aircraft  engine  problem.  In  reality,  the  situation  more  closely 
resembles  a population  of  size  oM  where  each  unit  operates  at  full  failure 
rate  X , rather  than  a population  of  M where  each  unit  operates  at  a 
reduced  failure  rate  aX  . 

Wiien  applied  to  a fleet  of  gas-turbine  engine  ships  where  provi- 
sioning for  the  gas  generator  component  of  the  engine  is  desired,  botJi  ad- 
justment procedures  were  compared  based  on  absolute  and  percentage  differ- 
ences in  availability  (this  example  is  detailed  in  Gross,  et  al  [5]).  The 
results  given  in  Table  II  show  that  there  is  very  little  difference  between 
tile  methods,  even  for  small  M , a , and  low  availabilities.  Even  though 
the  population  size  adjustment  may  more  closely  reflect  the  real  situation, 
the  advantage  to  adjusting  the  failure  rate  is  that  no  round  off  difficulties 
are  met,  since  failure  rate  is  a continuous  quantity  while  population  size 
is  discrete. 

4.  Transient  Effects 

In  order  to  account  for  population  and  reliability  growth  over  a 
muiti-year  planning  horizon  while  utilizing  classical  steady-state  theory. 
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TABLE  II 


ADJUSTMENT  OF  POPULATION  SIZE  VERSUS  ADJUSTMENT  OF  FAILURE  RATE 


M 

a 

c 

y 

A(aX) 

A(aM) 

A 

%A 

10 

.8 

3 

3 

.955 

.956 

.001 

0.04 

2 

2 

.802 

.805 

.003 

0.36 

.6 

2 

2 

.878 

.881 

.003 

0.37 

2 

2 

.940 

.942 

.002 

0.22 

1 

1 

.637 

.660 

.023 

3.51 

.2 

1 

1 

.813 

.827 

.014 

1.69 

50 

.8 

8 

8 

.971 

.972 

.001 

0.02 

6 

7 

.904 

.905 

.001 

0.12 

5 

5 

.655 

.661 

.006 

0.81 

.6 

5 

5 

.855 

.858 

.003 

0.36 

4 

3 

.473 

.485 

.012 

2.51 

.4 

3 

5 

.895 

.901 

.006 

0.72 

4 

4 

.886 

.890 

.004 

0.39 

3 

3 

.683 

.698 

.015 

2.20 

182 

.8 

10 

10 

.849 

.847 

-.002 

-0.15 

8 

11 

.757 

.756 

-.001 

-0.05 

11 

8 

.667 

.665 

-.002 

-0.24 

.6 

6 

9 

.727 

.740 

.043 

1.78 

7 

7 

.711 

.718 

.007 

0.92 

9 

6 

.633 

.637 

.004 

0.64 

.4 

5 

10 

.965 

.966 

.001 

0.05 

7 

7 

.945 

.945 

.000 

0.00 

10 

5 

.771 

.772 

.001 

0.04 

5 

5 

.702 

.707 

.005 

0.75 

7 

4 

.589 

.591 

.002 

0.33 

5 

3 

.339 

.346 

.005 

2.02 
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it  was  necessary  to  assume  the  population  reached  new  steady-state  levels 
instantaneously  each  year  in  the  planning  horizon.  For  this  approximation 
to  be  valid,  the  transient  effects  must  be  small  after  a relatively  short 
period,  say  three  months. 

Again,  for  a small  size  model  (M  = y = c = 1)  we  can  analytically 
work  out  transient  solutions  in  closed  form.  The  derivation  follows. 


Assuming  both  units  fail  at  Poisson  rate  X and  are  repaired  ex- 
ponentially at  mean  rate  y , the  dlf fc..ential  difference  equations  result- 
ing from  a birth-death  analysis  are 

PgCt)  = -^PQ(t)  + yPj^(t) 

pj^(t)  = -(A+y)p^(t)  + XpgCt)  + yp2(t)  (6) 

P2(t)  = -yp2(t)  + Xp^(t)  . 


Taking  Laplace  transforms  (LT) , we  get 

sPq(s)  - Pq(0)  = -APq(s)  + ypj^(s) 

sp^(s)  - p^(0)  = -(A+y)p^(s)  + APq(s)  + yp2(s) 

sp2(s)  - P2(0)  = -yp2(s)  + Ap^(s)  , 

or  tlie  following  three  equations  in  three  unknowns  to  be  solved, 
(s+A)Pq(s)  - mPqCs)  + OPq(s)  = Pq(0) 

-Apq(s)  + (s+A+y)p^(s)  - yp2(s)  = Pj^(O)  (7) 

OPq(s)  - Ap^(s)  + (s+y)p2(s)  = P2(0)  . 


To  solve  these,  we  can  use  Cramer's  rule.  Thus  it  is  first  necessary  to 
evaluate  the  determinant  of  the  denominator, 


D = 


s+A  -y  0 

-A  s+A+y  -y 
0 -A  s+y 

(s+A) (s+A+y) (s+y)  - Ay(s+A)  - Ay(s+y)  . 
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Expanding  and  cancelling  yields 

3 2 2 2 2 

D = s +2sX+2sy+  sXy  + sX  + sp 

= sCs^  -f-  2sX  -f-  2sp  + Ap  -f  X^  + p^)  . 

In  order  to  invert  the  LT,  we  would  like  D to  be  of  the  form 

D = (s+a) (s+b) (s+c)  , 

which  we  obtain  by  letting 

a = 0 

b = X + p + *^p 
c = X + p - i/Xp  ; 
that  is,  (8)  is  equivalent  to 

D = s(s+X+p  + Ap)(s  + X + p - Ap)  , 
which  can  be  verified  by  expanding  and  cancelling. 


(8) 


(9) 


Now  if  we  use  the  initial  condition  Pg(0)  = 1 » Pj(0)  = p^CO)  = 0 , 

that  is,  if  all  units  are  "up,"  the  right-hand  side  of  (7)  becomes  1,0,0 
and 


P2(s) 


s+X  -p  1 

-X  s+X+p  0 
0 -X  0 


D 


D 


(10) 


From  a table  of  LT  inverses  (see,  for  example,  Abromowitz  and  Stegun  [1]), 

1 


f(s)  = 


(s+a) (s+b) (s+c)  ’ 


(a  ^ b ^ c) 


yie] ds 


fft')  = - + (c-a)e  + (a-b)e 

(a-b) (b-c) (c-a) 


so  that 


, - A^[2/Xi;  + (X+p-AiI)e~(^-^^^^^'^  - (X+p+AI.)e-(^''^-^>M 

2'  ' . 2 , 2 


and 


2Ap(X^+Xp+p^) 
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s+A  1 0 

-A  0 -y 


, . _ I 0 0 s+y|  _ A(s+y)  . sA  yA 


D D D 


Using  (9)  Cor  D and  (10)  gives 


Pi (s)  = - 


(s+A+y+i^'Xy)  (s+A+y-  /Xy) 


+ A P9<s)  • 


Inversion  gives  [f  = l/(s+a)(s+b)  — > f = -g—  (e  - e )]  , 

p (t,  . . ^-<x+iJ+AS)t1  ^ i (j,  _ 

^ 2/AiI  L J A ^ 

Rather  than  solving  for  Pq(s)  inverting,  it  is  easier  to  use 

PQ(t)  = 1 - p^Ct)  - pj^(t)  . 


Now  since  availability  at  time  t is  the  conditional  probability 
that  a failing  unit  finds  the  repair  system  empty,  that  is,  the  other  unit 
"up,"  we  have 


Po(t)  + Pj^(t) 


1 - P2(t)  - Pj^Ct) 
1 - P2(t) 


Results  can  be  obtained  for  an  initial  condition  with  both  units  "down"  in 
a similar  fashion  and  are  given  by 

. , ^ u^[2A^+  (A+y-AII)e~^^'^P~^'^^*^  - (A+y+A^)e~^^'^P~*^^M 
^ 2Ay(A^+Ay+y^) 


^ 4.  P r -(^+y-*^)t  -(A+y+Ay)t' 

P,<o  .-Po(t) + — U 


These  results  were  used  to  produce  Figures  1 and  2.  Figure  1 shows 
convergence  to  steady  state  for  two  cases  yielding  the  same  steady-state 
availability,  one  with  large  A and  y and  the  other  with  small  A and 
y . ' We  note  that  the  large  A,y  case  converges  much  faster,  although  both 
converge  quite  quickly  for  the  "both  up"  initial  condition.  This  Illustrates 


A.y  arc  given  in  units  per  day. 
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the  two  factors  affecting  convergence  to  steady  state,  which  are  (i)  the 
role  of  decay  of  the  exponential  term  and  (ii)  how  far  the  initial  avail- 
ability is  from  the  steady-state  value.  For  Condition  (i)  we  desire  X 

and  p large  since  the  transient  terms  are  e t ^ ^^hile  for 

Condition  (ii)  A(0)  =1  is  much  closer  to  A(“)  = .9  than  is  A(0)  = 0 . 

Figure  2 shows  isopercent  error  at  t=3  months  on  the  A-p  plane. 
The  shapes  of  the  curves  also  illustrate  the  two  effects  described  above, 
namely,  small  percentage  error  for  X and  p both  large,  and  the  effect 
of  Initial  condition  closeness  to  steady  state,  since  each  point  on  an  iso- 
percent error  curve  has  a different  AC®)  = p/(X+p)  . The  dashed  line 
shows  all  X-p  combinations  for  A(®)  = .9  and  we  see  the  percentage 
errors  even  for  small  X,p  , when  A(0)  = 1 , are  small. 

To  obtain  similar  results  for  larger  population  sizes  is  alge- 
braically untenable.  For  transient  analysis,  unlike  the  unequal  failure 
rate  problem,  making  inferences  based  on  the  small  model  is  not  intuitively 
as  comfortable.  Thus  we  take  a different  approach  to  transient  analysis 
for  large  populations  — that  of  the  imbedded  Markov  chain. 

The  analysis  of  the  convergence  to  steady  state  in  Markovian  queues 
can  be  carried  out  either  through  a numerical  procedure,  as  described  by 
Cohen  [2],  or  through  an  Imbedded  Markov  chain  approach,  in  which  the 
system  is  observed  upon  failures  of  machines.  We  preferred  the  imbedded 
Markov  chain  approach  here  because  we  are  interested  in  the  state  of  the 
system  upon  failures  of  machines,  and  because  by  this  procedure  we  are 
able  to  obtain  some  results  which  we  cannot  obtain  using  the  other  approach. 

Let  X and  p denote  the  arrival  and  service  rates,  respectively. 


n 

when  there  are 

n 

n machines  in 

the  system. 

Thus, 

1 

X = . 
n 

[ MX 

[M  - 

(n-y)]X  , 

n £ y-1 

y < n < M4y 

and 

1 

( 0 

y 

otherwise , 
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ny  , 

0 < n < c 

cy  , 

c < n . 

Denote  by  N(in)  the  state  of  the  system  upon  the  rath  failure, 
where  {N(m)=n}  , n=0,l,2,...  , means  that  the  mth  failure  sees  n 
failed  machines  in  the  system.  Let 


p = P[N(m)=j  1 N(m-l)=n]  , 
J 


and  denote  by  A and  S the  inter failure  time  and  the  time  to  the 
n n 

completion  of  the  first  service,  respectively,  given  that  {N(m)=n}  . 
As  both  service  times  and  interarrival  times  are  exponential,  we  note 
that  for  j < n , 


n,j 


Pr{n-j+l  service  completions  before  the 
next  arrival  ] system  is  in  state  n} 


= Pr{l  service  completion  before  the  next 
arrival  | system  is  in  state  n}  • 
Pr{n-j  service  completions  before  the 
next  arrival  | system  is  in  state  n-1 } 


/ P[A  >t]  dF  (t)  p 
t=0  ^n 


For  j = n+1  , there  must  be  an  arrival  before  a service  completion. 
Til  us  we  have, 

00 

/ P[A  >t]dF  (t)p  , j=l,2,...,n  , n=0,l,2,...,M+y-2 

t=0  " n 

<» 

/ P[S  >t]dF  (t) 

t=0  " 

0 , j > n+1  . 

The  probabilities  p ^ . , j=l , . . . ,M+y-l  , that  were  not  specified  in 

riTy—  i > 3 

(12),  are  equal  to  p „ . since  at  least  one  completion  of  service 

M+y-2,j 

■should  take  place  before  any  arrival  can  occur.  We  use  now  the  fact  that 

the  parameters  of  A and  S are  X and  U , respectively,  and 

n n n+1  n+1  ' ^ ' 

obtain  from  (12)  that 


j=n+l  , n=0,l,2, . . . ,^t+y-2 


(12) 


i 

) 
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n+1 


n,n+l 


n=0,l, . . . ,M+y-2 


(13) 


Expression  (13)  can  now  be  used  to  establish  that,  for  j=n  in  (12), 


n+1 


n,n  y + A , X + y 
n+1  n+1  n ^n 


Applying  the  recursive  relation  in  (12)  for  j n yields 

n+1  y . 


n 


A. 

j- 


i=j+i  ’ 


3“0,l,.*«,n  • 


(14) 


Now  let  N(0)  denote  the  initial  state  of  the  system  and  let 


q (i)  = P[N(m)  = i]  , m=0,l,2 1=0 , 1 , . . . ,M+y-l 

m 


Denoting  by 


we  obtain 


q'  = (q„(o)>q„(i).---.q„(f^“i))  > 

m ' ra  m m 


^;;r+i  = • 


(15) 


where  P is  the  transition  matrix  {p.  . It  is  well  known  then  that 

i»  J 

q'  has  a limit  as  m ^ °°  , and  let  q'  = lim  q’  . Denote 
m ^ m 

m-x" 

.1 

Q (j)  = y q (i)  , ra=0,l,2,,..  , j=0,l, . . . ,M+y-l  , 

i=o 

and 

j 

Q(j)  = I q(i)  • 
i=l 

A natural  initial  situation  is,  of  course,  all  machines  up,  and 
one  would  like  to  know  how  this  situation  would  affect  tt»e  future  state 
vectors.  This  leads  us  to  the  lemma  below. 


Lemma  1 : If  Qjjj(i)  ^ Q(.i)  for  all  j=l , 2 , . . . ,M+y-l  , then 
‘),„4.|(  n i L'(  j)  for  all  i=0,l  , . . . ,M+y-l  . 


18  - 


T-360 


lienee 


Using  (15)  and  the  fact  that 


m 


q’P 


h.  . 

J 

we  obtain 


where  PL.  is  a column  vector  whose  ith  element  equals 
J 


FiCj)  = I k • 


From  (14)  we  can  see  that  P < P . so  therefore 

*^n+l,j  *^0,3 


(16) 


(17) 


(18) 


(19) 


(20) 


Thus  the  right-hand  side  of  (18)  is  nonnegative  since  the  elements  in  the 
column  vector  PL.  are  decreasing  and  the  sum  of  the  first  k elements 

of  (q'  - q')  is  always  nonnegative  and  larger  than  the  magnitude  of  the 
m 

k+lst  element  from  the  hypothesis  of  the  lemma,  Q > Q'  . 

tn 


Since  the  measure  of  the  system’s  performance  — the  availability  — 
was  defined  as 

A(m)  = Q (y-1)  , 
ni 

we  mav  conclude  from  Lemma  1 that  if  the  initial  vector  satisfies 
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then 

A(m)  > Q(y-l)  = A(«>)  , m=l,2,3,...  . (21) 

The  lemm;i  cannot  be  extended  to  include  A(m)  A(iiH-l)  . This  is  because 

X M+y-1 

A(m)  - A(m+1)  = q^(I-P)L  = q^(y-l)  — ^TTT  " I 

^y  My  k=y 

hence,  the  lemma's  conditions  can  be  met  while  Expression  (22)  is  negative. 
Thus  we  can  only  state  that  the  steady-state  availability  is  always  less 
than  the  transient  availabilities. 


The  decision  to  purchase  spares  or  to  add  servers  is  often  made 

periodically;  so  it  is  Important  to  study  the  relation  between  time  and 

number  of  arrivals.  It  is  clear  that  as  X < Mp  then  the  time  to  the 

n — 

ktl)  arrival  is  stochastically  smaller  than  an  Erlang  random  variable  with 

parameters  MX  and  k (r(MX,k))  . Furthermore,  as  the  system  stays  mainly 

in  till  states  in  which  X = MX  , it  is  obvious  that  the  use  of  r(MX,k) 

n 

as  the  distribution  of  the  time  to  the  kth  arrival  is  highly  justified. 
Nevertheless,  if  one  desires  more  accurate  results  for  the  relations  be- 
tween time  and  number  of  arrivals,  the  results  of  the  following  discus- 
sion may  be  applied. 


Tlien 


I.et  denote  the  time  to  the  kth  arrival  given  {N(0)=nf  . 


t -X  y 

P[T  (k)  < t]  = / X e P[T  (k-1)  < t-y]dv  , 
0 


> -M,y  -^,y  c-y 

PfT.(k)  • t|  = / M,  e ‘ e f ^ P[T  (k-1)  < t-y-u]dudy 

0 u=n  ^ 


1 


t -X  y -p  y 

+ / X e e P[T  (k-1)  < t-y]dy  , 
0 


(23) 


and  lor  n ■ I w<-  obtain 
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1 

We  illustrate  the  convergence  to  steady  state  with  data  from  the  I 

machine  repair  problem  given  in  Gross  et  al  [5].  In  that  study  it  is  \ 

assumed  that  all  changes  in  the  parameters  of  the  system  occur  simultane-  i 

ously  once  a year.  These  changes  include  the  addition  of  new  machines,  1 

i 

spares,  and  servers;  an  increase  in  the  service  rate;  and  a decrease  in  the  j 

failure  rates  of  all  units.  Both  the  units  added  and  the  spares  added  are 

assumed  tc  be  up.  Let 

J(i,T)  = minlk  : E[T^(k)  ^ t}  . (26) 

As  we  observe  the  system  upon  failures  of  machines,  we  desire  to  associate 
the  annual  points  of  change  in  the  system's  parameters  to  cumulative  counts 
of  failures.  We  chose  J(0,365)  to  represent  the  annual  number  of  failures. 

This  assumption  and  the  data  from  Figure  2 in  [5]  is  presented  in  Table  III. 


TABLE  III 

THE  PARAMETERS  FOR  FIGURE  3 


Year 

M 

c 

• * ' ■ " 

y 

1/m^ 

Steady 

State 

Avail- 

ability 

J(0, 365) 

Cumulative 
Count  of 
Failures 
TJ(0,365) 

75 

10 

3 

3 

0.0015 

65.0 

0.920 

5 

5 

76 

28 

5 

6 

0.0015 

62.5 

0.920 

15 

20 

77 

50 

8 

8 

0.0015 

60.0 

0.915 

26 

46 

78 

82 

10 

10 

0.0013 

57.5 

0.913 

37 

73 

79 

121 

12 

11 

0.0010 

55.0 

0.905 

46 

119 

and  U are  sliown  in  units  per  day. 


Figure  3 shows  the  availability  versus  time  for  the  data  of  Table 
ill,  assuming  an  initial  condition  of  all  units  up,  using  Equations  (13) 
and  (14)  to  calculate  tiie  p^^  and  then  performing  successive  multiplica- 
tion with  the  P nuitrix;  thus  yielding  q^'s  , Q's  and  A's  . From 
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Figure  3 we  can  see  that  when  the  number  of  calling  machines  is  small 
(and  therefore  J(0,365)  is  small)  then  the  system's  situation  is  quite 
far  from  steady  state  even  after  J(0,365)  failures.  As  the  number  of 
calling  units  increases,  the  availability  after  J(0,365)  failures  ap- 
proaches the  steady-state  availability. 


To  obtain  some  feeling  about  the  sensitivity  of  the  rate  of  conver- 
gence to  steady  state  as  a function  of  X and  v > we  calculate  the  value  of 


A(m) 


Q (y-1)  - Q(y-l) 
m 

Q(y-l) 


100.0  , 


for  N(0)  = 0 , m = J(0,90)  , M = 10  , and  c = y = 3 . Table  IV  gives 
A , J , and  selected  steady-state  availabilities  for  a variety  of  MX 
versus  cy  cases.  The  display  of  the  table  is  similar  to  that  of  Figure 
2 in  that  small  values  of  MX  are  at  the  bottom  and  small  values  of  cy 
ar-'  at  the  left.  We  can  see  the  same  type  of  effect  as  caused  by  the 
A(0)  = 1 curve  of  Figure  2,  although  the  "belly"  is  reached  for  much 
higher  MX  ; that  is,  for  fixed  cy  , the  errors  increase  up  to  a certain 
value  of  MX  and  then  start  to  decrease.  Again  we  see  the  same  two 
effects  as  in  Figure  2;  namely,  the  influence  of  the  proximity  of  the 
starting  availability  to  the  steady-state  value,  and  the  damping  effect 
of  large  MX  and  cy  on  the  transient  terms  (hence  the  "belly"  situa- 
tion). Selected  steady-state  availabilities  are  shown  which  point  out 
that  in  the  realistic  availability  range  (greater  than  0.8  , for  example) 
the  percent  errors  from  steady  state  after  three  months  are  quite  small 
unless  both  MX  and  cy  are  very  low. 
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VALUES  OF  J(0,90)  AS  A FUNCTION  OF 
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